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A method of inducing a CTL response in an animal which comprises administering to the animal mycobacteria trans- 
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INDUCTION OF CTL RESPONSES TO FOREIGN 
ANTIGENS EXPRESSED IN MYCOBACTERIA 

This invention relates to the induction of a T-cell 
response, in particular a cytotoxic T lymphocyte response. More 
particularly, this invention relates to the induction of CTL 
responses to proteins or polypeptides expressed by recombinant 
mycobacteria. 

Cell-mediated immunity (or CMI) of infections is thought to 
be a major line of defense against certain infections, such as 
viral infections and certain bacterial infections. For example. 
CMI may be significant in the development of an effective vaccine 
against human immunodeficiency virus (HIV) , or AIDS virus, 
because HIV vaccines and/or therapies based on the generation of 
passive transfer of HIV-specific antibody in the absence of 
cell-mediated immunity have not yielded consistent protection in 
primates challenged with the HIV virus. Thus, interest has 
turned to the induction of cell-mediated responses to various 
infections, such as for example, HIV infection, and to the 
identification of proteins or polypeptides that a stimulate a 
cytotoxic T lymphocyte response, and to methods of administering 
such proteins or polypeptides. 

In accordance with an aspect of the present invention, there 
is provided a method of inducing a CTL response in an animal 
comprising administering to the animal mycobacteria transformed 
with at least one DNA sequence which encodes a protein or peptide 
or fragment or derivative thereof which includes an epitope which 
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is recognized by cytotoxic T lymphocytes. The mycobacteria are 
administered in an amount effective to induce a CTL response in 
an animal. 

Proteins or polypeptides for which the at least one DNA 
sequence may encode, include, but are not limited to, 
Mycobacterium leprae antigens; Mycobacterium tuberculosis 
antigens; Rickettsia antigens; Chlamydia antigens; Coxiella 
antigens; malaria sporozoite and merozoite proteins, such as the 
circumsporozoite protein from Plasmodium berohei sporozoites; 
Clostridium antigens; Leishmania antigens; Salmonella antigens; 
Mycobacterium africanum antigens; Mycobacterium intracellular 
antigens; Mycobacterium avium antigens; E.coli antigens; Borrelia 
antigens; Listeria antigens; Franciscella antigens; Yersinia 
antigens; Treponema antigens; Schistostoma antigens; Filaria 
antigens; Pneumococcus antigens; Staphylococcus antigens; Herpe* 
virus antigens; influenza and parainfluenza virus antigens; 
measles virus antigens; mumps virus antigens; hepatitis virus 
antigens; Shigella antigens; Bordatella antigens; Hemophilus 
antigens; Streptococcus antigens; polio virus antigens; Rift 
Valley Fever virus antigens; dengue virus antigens; Human 
Immunodeficiency Virus (HIV) antigens; and respiratory syncytial 
virus (RSV) antigens. 

In one embodiment, the at least one DNA sequence encodes at 
least one protein or polypeptide or fragment or derivative 
thereof which includes an eptiope which is recognized by 
cytotoxic T lymphocytes induced by an HIV protein or fragment or 
derivative'thereof . The at least one DNA sequence may encode an 
HIV protein or fragment or derivative thereof. HIV proteins or 
polypeptides which may be encoded by the at least one DNA 
sequence includes but are not limited to, HIV-I-gp 120; HIV-I-gp 
41; HIV-I-gp 160; HIV-I-pol; HIV-I-nef; HIV-I-tat; HIV-I-rev ; 
HIV-I-vif; HIV-I-vpr; HIV-I-vpu; HIV-I-gag; HIV-2-gp 120; 
HIV-2-gp 160; HIV-2-gp 41; HIV-2-gag; HIV-2-pol; HIV-2-nef; 
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HIV-2-tat; HIV-2-rev; HIV-2-vif; HIV-2-vpr; HIV-2-vpu; and 
HIV-2-vpx. 

Mycobacteria which may be transformed with the at least one 
DNA sequence, which encodes a protein or polypeptide or fragment 
or derivative thereof which includes an epitope which is 
recognized by cytotoxic T lymphocytes, include, but are not 
limited to, Mycobact erium bovia -BCG. M.smeamatia . m. avium . 
M.Phlei , M.fortiutum , MLlufu, M.paratuberculosis , M.habana , 
M.scrofalaceum, and M. intracellulare . In a preferred embodiment, 
the mycobacterium is M.bovis -BCG or a mutant thereof. 

The at least one DNA sequence may be contained within an 
expression vector, which is transformed into a mycobacterium, 
whereby the mycobacterium expresses the protein or polypeptide or 
fragment or derivative thereof which includes an epitope which is 
recognized by cytotoxic T lymphocytes. 

The expression vector may be, for example, a temperate 
shuttle phasmid or a bacterial-mycobacterial shuttle plasmid. 
Each of these vectors may be used to introduce the at least one 
DNA sequence encoding a protein or polypeptide or fragment or 
derivative which includes an epitope which is recognized by 
cytotoxic T lymphocytes, stably into mycobacteria, in which the 
at least one DNA sequence may be expressed. When a shuttle 
phasmid, which replicates as a plasmid in bacteria and a phage in 
mycobactreia, is employed, integration of the phasmid, which 
includes the at least one DNA sequence encoding a protein or 
polypeptide, or fragment or derivative thereof, which includes an 
epitope which is recognized by cytotoxic T lymphocytes, into the 
mycobacterial chromosome occurs through site-specific 
integration. The at least one DNA sequence which encodes a 
protein or polypeptide or fragment or derivative thereof, which 
includes an epitope which is recognized by cytotoxic 
T lymphocytes, is replicated as part of the chromosomal DNA. When 
a bacterial-mycobacterial shuttle plasmid is employed, the at 
least one DNA sequence which encodes a protein or polypeptide or 
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fragment or derivative thereof, which includes an epitope which 
is recognized by cytotoxic T lymphocytes, is stably maintained 
extrachromosomally in a plasmid. Expression of the at least one 
DNA sequence occurs extrachromosomally (e.g., episomally) . For 
example, the at least one DNA sequence is cloned into a shuttle 
plasmid and the plasmid is introduced into a mycobacterium such 
as those hereinabove described, wherein the plasmid replicates 
episomally. Examples of such shuttle phasmids and 
bacterial-mycobacterial shuttle plasmids are further described in 
Application Serial No. 361,944, filed June 5, 1989, which is 
hereby incorporated by reference. 

In one embodiment the mycobacteria are transformed with an 
expression vector which comprises at least one DNA sequence 
encoding a protein or polypeptide which includes an epitope which 
is recognized by cytotoxic T lymphocytes, and a promoter selected 
from the class consisting of mycobacterial promoters and 
mycobacteriophage promoters for controlling expression of the DNA 
encoding the heterologous protein or polypeptide, or fragment or 
derivative thereof, which includes an epitope which is recognized 
by cytotoxic T lymphocytes. 

Mycobacterial and mycobacteriophage promoters which may be 
employed include, but are not limited to, mycobacterial promoters 
such as the BCG HSF60 and HSP70 promoters; the mycobactin 
promoter from M. tuberculosis and BCG; the mycobacterial 14 kda 
and 12 kda antigen promoters; the mycobacterial o-antigen 
promoter from M. tuberculosis or BCG; the MBP-70 promoter, the 
mycobacterial 45 kda antigen promoter from M. tuberculosis or BCG; 
the superoxide dismutase promoter; the mycobacterial asd 
promoter , and mycobacteriophage promoters such as the Bxbl, LI, 
L5, and TM4 promoters. In one embodiment, the promoter is a 
mycobacterial heat shock protein promoter such as HSP60 or HSP70 . 

The promoter sequence may, in one embodiment, be part of an 
expression cassette which also includes a portion of the gene 
normally under the control of the promoter. For example, when a 
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mycobacterial HSP60 or HSP70 promoter is employed, the expression 
cassette may, within the scope of the present invention, include 
in addition to the promoter, a portion of the gene for the HSP60' 
or HSP70 protein. When the expression cassette and the DNA 
encoding the protein or polypeptide, or fragment or derivative 
thereof, which includes an epitope which is recognized by 
cytotoxic T lymphocytes are expressed, the protein expressed by 
the cassette and the DNA encoding the protein or polypeptide is a 
fusion protein of a fragment of a mycobacterial protein (eg the 
HSP60 or HSP70 protein), and the protein or polypeptide or 
fragment or deriviative thereof which includes an epitope which 
is recognized by cytotoxic T lymphocytes. 

In a preferred embodiment, the transcription initiation 
site, the ribosomal binding site, and the start codon, which 
provides for the initiation of the translation of mRNA, are eacft 
of mycobacterial origin. The stop codon, which stops translation 
of mRNA, thereby terminating synthesis of the protein or 
polypeptide or fragment or derivative thereof which includes an 
epitope which is recognized by cytotoxic T lymphocytes, and the 
transcription termination site, may be of mycobacterial origin, 
or of other bacterial origin, or such stop codon and 
transcription termination site may be those of the DNA encoding 
the protein or polypeptide which includes an epitope which is 
recognized by cytotoxic T lymphocytes. 

Preferably, the mycobacterial promoter is a BCG promoter, 
and the mycobacterium is BCG. 

In one embodiment, the expression vector may further include 
DNA which encodes for proteins or polypeptides such as, but not 
limited to, antigens, anti-tumor agents, enzymes, lymphokines, 
pharmacologic agents, immunopotentiators, reporter molecules of 
interest in a diagnostic context, and selectable markers. 

Selectable markers which may be encoded include, .but are not 
limited to, the 8-galactosidase marker, the kanamycin resistance 
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marker, the chloramphenicol resistance marker , the neomycin 
resistance marker, and the hygromycin resistance marker. 

In accordance with one embodiment, the vector further 
includes a mycobacterial origin of replication. 

In accordance with another embodiment, the vector may be a 
plasmid. The plasmid may be a non-shuttle plasmid, or may be a 
shuttle plasmid which further includes a bacterial origin of 
replication such as an E.coli origin of replication, a Bacillus 
origin of replication, a Staphylococcus origin of replication, a 
Streptomvces origin of replication, or a pneumococcal origin of 
replication. In one embodiment, the shuttle plasmid includes an 
E. coli origin of replication. 

In accordance with yet another embodiment, the vector may 
further include a multiple cloning site, and the DNA encoding fpr 
the protein or polypeptide, or fragment or derivative thereof, 
which includes an epitope which is recognized by cytotoxic 
T lymphocytes is inserted in the multiple cloning site. 

In addition to the DNA encoding a heterlogous protein or 
polypeptide, and the mycobacterial promoter for controlling 
expression of the DMA encoding the protein or polypeptide which 
includes an epitope which is recognized by cytotoxic 
T lymphocytes, the expression vector may, in one embodiment, 
further include a DMA sequence encoding bacteriophage integration 
into a mycobacterium chromosome. Bacteriophages from which the 
DNA sequence encoding bacteriophage integration into a 
mycobacterium chromosome may be derived include, but are not 
limited to, mycobacteriophages such as but not limited to, the 
L5, LI, Bxbl, and TM4 mycobacteriophages; the lambda phage of L. 
coli ; the toxin phages of Corvnebacteria ; phages of Actinomycetes 
and Nor card! a ; the *C31 phage of Streptomvces ; and the P22 phage 
of Salmonella . Preferably, the DNA sequence encodes 
mycobacteriophage integration into a mycobacterium chromosome. 
The DNA sequence which encodes bacteriophage integration into a 
mycobacterium chromosome may include DNA which encodes integrase, 
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which is a protein that provides for integration of the vector 
into the mycobacterial chromosome. Preferably, the DNA sequence 
encoding mycobacterial phage integration also includes DNA which 
encodes an attP site. 

The DNA encoding the attP site and the integrase provides 
for an integration event which is referred to as site-specif ic 
integration. DNA containing the attP site and the integrase gene 
is capable of integrating into a corresponding attB site of a 
mycobacterium chromosome. 

It is to be understood that the exact DNA sequence encoding 
the attP site may vary among different phages, and that the exact 
DNA sequence encoding the attB site may vary among different 
mycobacteria. 

Examples of expression vectors which include mycobacterial 
promoters and mycobacteriophage promoters, for controlling the kt 
least one DNA sequence encoding a protein or polypeptide, or 
fragment or derivative thereof, which includes an epitope which 
is recognized by cytotoxic T lymphocytes are further described in 
application Serial No. 642,017, filed January 16, 1991, which is 
a continuation of Application Serial No. 552,828, filed July 16, 
1990, now abandoned. The contents of Application Serial No. 
642,017 are hereby incorporated by reference. 

In another embodiment, the mycobacteria are transformed with 
DNA which comprises a first DNA sequence which is a phage DNA 
portion encoding bacteriophage integration into a mycobacterium 
chromosome, and DNA including the at least one DNA sequence 
encoding a protein or polypeptide, or fragment or derivative 
thereof, which includes an epitope which is recognized by 
cytotoxic T lymphocytes . 

The term "phage DNA portion", as used herein, means that the 
DNA sequence is derived from a phage and lacks the DNA which is 
required for phage replication. 

Bacteriophages from which the phage DNA portion may be 
derived include, but are not limited to, mycobacteriophages, such 
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as but not limited to the those hereinabove described. 
Preferably, the phage DNA portion encodes mycobacteriophage 
integration into a mycobacterium chromosome. 

In a preferred embodiment , the first DNA sequence includes 
DNA encoding integrase, which is a protein that provides for 
integration of the DNA into the mycobacterial chromosome. Most 
preferably, the first DNA sequence also includes DNA which 
encodes an AttP site. 

The DNA sequence encoding the AttP site and the integrase 
provides for an integration event which is referred to as 
site-specific integration- DNA containing the AttP site and the 
integrase gene is capable of integration into a corresponding 
AttB site of a mycobacterium chromosome. 

It is to be understood that the exact DNA sequence encoding 
the attP site may vary among different phages, and that the exact 
DNA sequence encoding the attB site may vary among different 
mycobacteria. 

The integration event results in the formation of two new 
junction sites called AttL and AttR, each of which contain part 
of each .of AttP and AttB. The inserted and integrated non-phage 
DNA which includes the first DNA sequence and the at least one 
DNA sequence, which encodes a protein or polypeptide, or fragment 
or derivative thereof, which includes an epitope which is 
recognized by cytotoxic T lymphocytes is flanked by the AttL and 
AttR sites. The insertion and integration of the phage DNA 
portion results in the formation of a transformed mycobacterium. 

The DHA may further include DNA which encodes a selectable 
marker or Barkers; or other proteins or polypeptides of interest, 
such as, but not limited to anti-tumor agents, enzymes, 
lymphokines, pharmacologic agents, immunopotentiators, and 
reporter molecules of interest in a diagnostic context. 

Selectable markers which may be encoded include, but are not 
limited to, the kanamycin resistance marker, the neomycin 
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resistance marker, the chloroamphenicol resistance marker, and 
the hygromycin resistance marker. 

The phage DNA portion of the present invention, which 
includes the first DNA sequence encoding mycobacterium phage 
integration into a mycobacterium chromosome, and the at least one 
DNA sequence encoding a protein or polypeptide, or fragment or 
derivative thereof, which includes an epitope recognized by 
cytotoxic T lymphocytes, may be constructed through genetic 
engineering techniques known to those skilled in the art. In a 
preferred embodiment, the phage DNA portion may be a plasmid 
including, in addition to the DNA encoding integration and the 
DNA encoding a protein or polypeptide, or fragment or derivative 
thereof, which includes an epitope recognized by cytotoxic 
T lymphocytes, an origin of replication for any of a wide variety 
of organisms, which includes, but is not limited to, E.coli , 
Streptomvcea species, Bacillus species, Staphylococcus species,. 
Shigella species, Salmonella species and various species of 
pneumococci. Most preferably, the plasmid includes an origin of 
replication for E.coli . 

The phage DNA portion also may include a suitable promoter. 
Suitable promoters include, but are not limited to, mycobacterial 
promoters and mycobacteriophage promoters such as those 
hereinabove described. 

The promoter sequence may, in one embodiment, be part of an 
expression cassette which also includes a portion of the gene 
normally under the control of the promoter, as hereinabove 
described. For example, when a mycobacterial HSP60 or HSP70 
promoter is employed, the expression cassette may include, in 
addition to the promoter, a portion of the gene for the HSP60 or 
HSP70 protein. When the expression cassette and the DNA encoding 
the protein or polypeptide, or fragment or derivative thereof, 
which includes an epitope which is recognized by cytotoxic T 
lymphocytes are expressed, the protein expressed by the cassette 
and the DNA encoding the protein or polypeptide is a fusion 
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protein of a fragment of a mycobacterial protein (eg., the HSP60 
or HSP70 protein), and the protein or polypeptide, or fragment or 
derivaitve thereof, which includes an epitope which is recognized 
by cytotoxic T lymphocytes. 

In a preferred embodiment, the transcription initiation 
site, the ribosomal binding site, and the start codon, which 
provides for the initiation of the translation of mRNA, are each 
of mycobacterial origin. The stop codon, which stops translation 
of mRNA, thereby terminating synthesis of the protein or 
polypeptide, or fragment or deriviatve thereof, which includes an 
epitope which is recognized by cytotoxic T lymphocytes, and the 
transcription termination site, may be of mycobacterial origin, 
or of other bacterial origin, or such stop codon and 
transcription termination site may be those of the DNA encoding 
the protein or polypeptide, or fragment or derivative thereof, 
which includes an epitope which is recognized by cytotoxic 
T lymphocytes. 

Examples of DNA which includes a first DNA sequence which is 
a phage DNA portion encoding bacteriophage integration into a 
mycobacterium chromosome, and DNA including the at least one DNA 
sequence encoding a protein or polypeptide, or fragment or 
derivative thereof, which includes an epitope which is recognized 
by cytotoxic T lymphocytes are further described in application 
Serial No. 553,907, filed July 16, 1990, the contents of which 
are hereby incorporated by reference. 

Mycobacteria which are transformed which DNA which encodes 
for a prot«in or polypeptide or fragment(s) or derivative(s) 
thereof, which includes an epitope which is recognized by 
cytotoxic T lymphocytes, may be employed in a composition, such 
as a vaccine, for inducing a CTL response in an aniaml. The 
vaccine may be administered to a human or non-human animal. 

To form such a vaccine, the transformed mycobacteria are 
administered in conjunction with a suitable pharmaceutical 
carrier. As representative examples of suitable carriers there 
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may be mentioned: mineral oil, alum, synthetic polymers, etc. 
Vehicles for vaccines are well known in the art and the selection 
of a suitable vehicle is deemed to be within the scope of those 
skilled in the art from the teachings contained herein. The 
selection of a suitable vehicle is also dependent upon the manner 
in which the vaccine is to be administered. The vaccine may be 
in the form of an injectable dose and may be administered 
intramuscularly, intravenously, orally, intradermally, or by 
subcutaneous administration* 

Other means for administering the vaccine or therapeutic 
agent should be apparent to those skilled in the art from the 
teachings herein; accordingly, the scope of the invention is not 
to be limited to a particular delivery form. 

When the transformed mycobacteria are employed as a vaccine, 
such a vaccine has important advantages over other presently 
available vaccines. Mycobacteria have, as hereinabove indicated, 
adjuvant properties among the best currently known and, 
therefore, stimulate a recipient's immune system to respond with 
great effectiveness. This aspect of the vaccine induces 
cell-mediated immunity and thus is especially useful in providing 
immunity against pathogens in cases where cell-mediated immunity 
appears to be critical for resistance. Also, mycobacteria may 
stimulate long-term memory or immunity. It thus may be possible 
to prime long-lasting T cell memory, which stimulates secondary 
antibody responses neutralizing to the infectious agent. Such 
priming of T cell memory is useful, for example, against 
pertussis, malaria, influenza virus, Herpes virus, rabies, Rift 
Valley fever virus, dengue virus, measles virus, Human 
Immunodeficiency Virus (HIV), and respiratory syncytial virus. 

The invention will now be described with respect to the 
following examples; however, the scope of the present invention 
is not to be limited thereby. 

Example 1 
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A. Construction of plasmid including mycobacterial promoter 
expression cassette and lacZ gene. 
1. Construction of PYUB125 

Plasmid pALSOOO, a plasmid which contains an origin of 
replication of M. fortuitum , and described in Labidi, et al., 
FEMS Microbiol. Lett .. Vol. 30, pgs. 221-225 (1985) and in Gene , 
Vol. 71, pgs. 315-321 (1988), is subjected to a partial Sau 3A 
digest, and 5kb fragments are gel purified. A 5kb fragment is 
then ligated to Bam HI digested pIJ666 (an. E. coli vector 
containing an E. coli origin of replication and also carries 
neomycin-kanamycin resistance, as described in Kieser, et al., 
Gene , Vol. 65, pgs. 83-91 (1988) to form plasmid pYUB12. A 
schematic of the formation of plasmid pYUB12. A schematic of the 
formation of plasmid pYUB12 is shown in Figure 1. pYUB12 and 
pIJ666 were then transformed into M. smegma tis and BCG. 
Neomycin- resistant transformants that were only obtained by 
pYUB12 transformation confirmed that pALSOOO conferred autonomous 
replication to plJ666 in M. smeqmatis and BCG. 

Shotgun mutagenesis by Snapper, et al (1988, hereinabove 
cited) indicated that no more than half of the pALSOOO plasmid 
was necessary to support plasmid replication in BCG. This 
segment presumably carried open reading frames ORF1 and 0RF2, 
identified by Rauzier, et al., Gene , Vol. 71, pgs. 315-321 
(1988), and also presumably carried a mycobacterial origin of 
replication. pYUB12 is then digested with Hpal and EcoRV, a 2586 
bp carrying this region or segment pALSOOO is removed and ligated 
to PvuII digested pYUBS. Plasmid pYUB8 (a pBR322 derivative) 
includes an E. coli replicon and a kan R (aph) gene. Ligation of 
the 2586 bp pYUB12 fragment to PvuII digested pYUB8 results in 
the formation of pYUB53, as depicted in Figure 2. Transformation 
of pYUB53 confirmed that the EcoRV-Hpal fragment, designated 
M.rep, was capable of supporting autonomous replication in BCG. 

Plasmid pYUB53 was then digested with AatI, EcoRV, and PstI 
in order to remove the following restriction sites: 
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AatI 5707 
EcoRI 5783 
BamHI 5791 
Sail 5797 
PstI 5803 
PstI 7252 
Sail 7258 
BamHI 7264 
EcoRI 7273 
Clal 7298 
Hindi I I 7304; and 
EcoRV 7460 

Fragment ends are then flushed with T4 DNA polymerase and 
religated to form plasmid pYUB125, construction of which is shown 
in Figure 3. 

2^ Eliminat ion of superfluous vector DNA from PYUB125 

792 bases of the tet gene, which had been inactivated by 
prior manipulations, was eliminated by a complete Narl digest, 
gel purification of the 6407 bp fragment, and 
ligation/recirculation, transformation of E. coll strain HB101 
and selection of Kan transf ormanta . The construction of 
resulting plasmid, pMVlOl, is schematically indicated in Figure 
4, and the DNA sequence of pMVlOl, which includes markings of 
regions which will be deleted, and of mutations, as hereinafter 
described, is shown in Figure 5. 

3. Construction of expression cassette baaed on BCG HSP60. 

Among the most abundant proteins in mycobacteria is the 
HSP60 heat shock protein (also known as the 65 kda antigen). 
Because abundance of the HSP60 protein in mycobacteria indicates 
strong HSP60 gene expression, the sequence controlling HSP60 
expression was chosen to control expression of heterologous genes 
encoding antigens or other proteins in BCG. 

The published sequence of the BCG HSP60 gene (Thole, et al. 
Infect, and Immun. , Vol. 55, pgs. 1466-1475 (June 1987)), and 
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surrounding sequence permitted the construction of a cassette 
carrying expression control sequences (i.e., promoter and 
translation initiation sequences) by PGR. The BCG HSP61 cassette 
(Figure 6) contains 375 bases 5' to the BCG HSP60 start codon, 
and 15 bases (5 codons) 3' to the start codon. PCR 
oligonucleotide primers were then synthesized. Primer Xba-HSP60, 
of the following sequence: 

CAG ATC TAG ACG GTG ACC ACA ACG CGC C / 
was synthesized for the 5* end of the cassette, and primer 
Bam-HSP61, of the following sequence: 

CTA GGG ATC CGC AAT TGT CTT GGC CAT TG ^ 
was synthesized for the 3 f end of the cassette. The primers were 
used to amplify the cassette by PCR from BCG strain Pasteur 
chromosomal DNA. The addition of the Bam HI site at the 3' end 
of the cassette adds one codon (Asp) to the first six codons of 
the HSP60 gene. 

Each of pMVlOl and the PCR cassette HSP61 was digested with 
Nhel and BamHI. The PCR cassette was then inserted between the 
Nhel and BamHI sites of pMVlOl, then ligated to form plasmid 
pMV65A (Figure 7). 

The' E. coli lac 2 gene (Figure 8) was used as a reporter, or 
marker gene to assay the ability of the HSP61 cassette to express 
heterologous genes in BCG. A BamHI restriction fragment carrying 
the lac 2 gene was cloned into the Bam HI site of Bam HI digested 
pMV65A, resulting in the formation of pMV65A/L2 as indicated 
schematically in Figure 7. The formation of pMV65A/L2 results in 
a fusion between the HSP60 and lac 2 genes at the sixth codon of 
the HSP60 gene and the sixth codon of the lac 2 gene. pMV65A/L2 
was then transformed into E. coli . Blue E. coli colonies were 
selected on x-gal plates for the presence of pMV65A/L2, thus 
indicating that the HSP60 promoter and translation initiation 
sequences were also active in E. coli . 

pMV65A/L2 was then transformed into BCG and plated on Dubos 
Oleic Agar plates containing x-gal. All BCG colonies resulting 
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from this transformation exhibited blue color, thus indicating 
that the lac_Z gene product (B-galactosidase) was expressed in 
BCG. SDS polyacrylamide gel electrophoresis was performed on 
lysates of the pMV65A/LZ BCG recombinants, revealing that 
B-galactosidase protein was expressed to levels in excess of 10% 
of total BCG protein (as determined by staining with Coomassie 
brilliant blue). These data indicated that BCG HSP61 expression 
cassette was functional in expression vector pMV65A. 

Example 2 

Cytotoxic T. lymphocyte res ponse to e coli B-^lactosid»»» 

E> coli B-galactosidase was expressed in BCG as a six amino 
acid fusion protein with BCG hsp 60 protein using 
extrachromosomal plasmid vector pMV65A/L2 utilizing the HSP60 
promoter to drive expression. The recombinant BCG was grown to. 
mid- log phase in Dubos media and concentrated by centrifugation. 
The bacteria were then re-suspended in PBS plus 0.05% Tween 80- 
and cup sonicated briefly to disperse clumped bacteria. Six week 
old BALB/c mice were inoculated with a single dose of 2 X 10 2 , 
2 X 10 , or 2 X 10 6 colony forming units (CFU'S determined 
post-inoculation) by either intradermal (ID), intraperitoneal 
(IP), or intravenous (IV) injection. At 14 or 19 weeks 
post- immunization splenocytes were harvested from mice and CTL 
activity was measured. CTL activity was measured as follows! 

Splenocytes (ACK-treated, 5 X 10 6 /ml) were stimulated in 
vitro in 10 ml in upright T25 flasks by co-culture for 5 days 
with mitomycin C- treated cells transfected with the lac Z gene 
(C3-4 cells; 5 X 10 6 /ml). A 4 hr. 51 Cr release assay was then 
performed in triplicate using P815 and P13.1 cells (P815 ceils 
transfected with the lac Z gene) as targets. Various 
effector-target ratios were tested using 5,000 targets/well. 
Specific lysis was calculated as follows: % specific lysis = 
100 X (release by effector cells minus spontaneous release/ 
maximal release minus spontaneous release]. 
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At 19 weeks post-immunization, the remaining animals in each 
group were boosted by intraperitoneal injection at 10 ng of 
purified lac 2 emulsified in incomplete Freund's adjuvant (IFA). 
Splenocytes were then harvested from these animals at 23 weeks 
and CTL activity was again measured. Unimmunized animals or 
animals immunized with lac Z emulsified in IFA, or animals 
immunized with vaccinia virus expressing lac 2 served as 
controls. 

The results of the avove experiments, as determined by % 
specific lysis of target cells, indicated that a CTL response was 
induced in mice immunized with BCG transformed with the 
expression vector pMV65A/LZ. 

Example 3 

Construction of integrating plasmid including mycobacterial 
promoter expression cassette and HIV-I-op 120 gene . 
1. Elimination of undesirable restriction sites in aph fkan R ). 
gene . 

To facilitate future manipulations, the Hindi I I and Clal 
restriction sites in the aph gene in plasmid pMVlOl were 
mutagenized simultaneously by polymerase chain reaction (PCR) 
mutagenesis according to the procedure described in Gene , Vol. 77 
pgs. 57-59 (1989). The bases changed in the 'aph gene were at the 
third position of codons (wobble bases) within each restriction 
site and the base substitutions made were designed not to change 
the amino acid sequence of the encoded protein. 

Separate PCR reactions of plasmid pMVlOl with primer 
ClaMut-K«n~+ HindRMut-Kan and HindFMut-Kan * Barn-Kan were 
performed at 90°C (1 rain.), 50°C (1 min. ), and 72°C (1 min. ) for 
25 cycle*. The PCR primers had the following base seqeuences: 

ClaMut-Kan 

CTT GTA TGG CAA GCC CC 3 
HindRMut-Kan 

GTG AGA ATG GCA AAA GAT TAT GCA TTT CTT TCC AG f 
HindFMut-Kan 
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GTG TGG AAA GAA ATG CAT AAT CTT TTG CCA TTC TCA CCG G J~ 
Bam-Kan 

CGT AGA GGA TCC AGA GGA CG 6 
The resulting PCR products were gel purified and mixed and a 
single PCR reaction without primers was performed at 94°C (1 
min.), 72°C (1 min. ) for 10 cycles. Primers ClaMut-Kan and 
Bam-Kan were added and PCR was resumed at 94°C (1 min.), 50°C (l 
min.), and 72°C (2 min.) for 20 cylces. The resulting PCR 
product (Kan. mut) was digested with BamHI and gel purified. 
Plasmid pMVlOl was digested with Clal and cohesive ends were 
filled in by Klenow + dCTP + dGTP. Klenow was heat inactivated 
and the digest was further digested with BamHI. The 5232 base 
pair fragment was gel purified and mixed with fragment Kan. mut 
and ligated. The ligation was transformed into E. coli strain 
HB101 and Kan colonies were screened for plasmids resistant to 
Clal and Hindlll digestion. Such plasmids were designated as 
pMVllO, which is depicted in Figure 4. 

2-: Elimination of sequences not necessary for olasmid 

replication in mycobacteria. 

Plasmid pMVllO was resected in separate constructions to 
yield plasmids pMVlll and pMV112. In one construction, pMVllO 
was digested with Narl and Ball, the ends were filled in, and a 
5296 base pair fragment was ligated and recircularized to form 
pMVlll. In another construct, pMVllO was digested with Ndel and 
SplI, the ends were filled in, and a 5763 base pair fragment was 
ligated and recircularized to form pMV112. Schematics of the 
constructions of pMVlll and pMV112 are shown in Figure 9. These 
constructions further eliminated superfluous E. coli vector 
sequences derived from pALSOOO not necessary for mycobacterial 
replication. Cloning was performed in E. coli . Plasmids pMVlll 
and pMV112 were tested for the ability to replicate in fk. 
smegmatis. Because both plasmids replicated in M. smeqmatis the 
deletions of each plasmid were combined to construct pMV113 . 
(Figure 9). 
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To construct pMV113, pMVlll was digested with BamHI and 
EcoRI, and a 1071 bp fragment was isolated. pMV112 was digested 
with BamHI and EcoRI , and a 3570 bp fragment was isolated, and 
then ligated to the 1071 bp fragment obtained from pMVlll to form 
pHV113. These constructions thus defined the region of pALSOOO 
necessary for autonomous replication in mycobacteria as no larger 
than 1910 base pairs. 

3.. Mutagenesis of restriction sites in mycobacterial 

replicon. 

To facilitate further manipulations of the mycobacterial 
replicon, PCR mutagenesis was performed as above to eliminate the 
Sal I, EcoRI, and Bglll sites located in the open reading frame 
known as ORFl of pAL5000. PCR mutagenesis was performed at 
wobble bases within each restriction site and the base 
substitutions were designed net to change the amino acid sequence 
of the putative encoded ORFl protein. The restriction sites were 
eliminated one at a time for testing in mycobacteria. It was 
possible to eliminate the Sail and EcoRI without altering 
replication in M. smeomatls . In one construction PCR mutagenesis 
was performed at EcoRI1071 of pMV113 with primers Eco Mut - M.rep 
and Bam-M.rep to form pMV117, which lacks the EcoRI1071 site. 
Primer Eco Mut - M.rep has the following sequence: 
TCC GTC CAA CGA GTG TCC CGG A; 7 

and Bam-M.rep has the following sequence: 

r 

CAC CCG TCC TGT GGA TCC TCT AC. 0 

In another construction, PCR mutagenesis was performed at 
the Sail 13"89 site with primer Sal Mut - M.rep and Bam-M.rep to 
form pMV119, which lacks the Sail 1389 site. Primer Sal Mut- 
M.rep has the following sequence: 

TOG CGA CCG CAG TTA CTC AGG CCT. f 

PMV117 was then digested with ApaLI and Bglll, and a 3360 bp 
fragment was isolated. pMV119 was digested with ApaLI and Bglll, 
and a 1281 bp fragment was isolated and ligated to the 3360 bp 
fragment isolated from pMV117 to form pMV123. A schematic of the 
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constructions of plasmids pMV117, pMV119, and pMV123 is shown in 
Figure 10. Elimination of the Bglll site, however, either by PCR 
mutagenesis or Klenow fill in, eliminated plasmid replication in 
mycobacteria, thus suggesting that the Bglll site is in proximity 
to, or within a sequence necessary for mycobacteria plasmid 
replication. 

4. Construction of DMV200 series vectora. 

To facilitate manipulations of all the components necessary 
for plasmid replication in E. coli and mycobacteria, (E. rep. and 
M. rep.) and selection of recombinants (Kan R ), cassettes of each 
component were constructed for simplified assembly in future 
vectors and to include a multiple cloning site (MCS) containing 
unique restriction sites and transcription and translation 
terminators. The cassettes were constructed to allow directional 
cloning and assembly into a plasmid where all transcription is 
unidirectional. 

Kan R Cassette 

A ONA cassette containing the aph (Kan R ) gene was 
constructed by PCR with primers Kan5* and Kan3'. An Spel site 
was added to the 5' end of the PCR primer Kan3', resulting in the 
formation of a PCR primer having the following sequence: 
CTC GAC TAG TOA GGT CTG CCT CGT GAA G. fO 
Bam HI + Nhel sites were added to the 5' end of the primer 
Kan5 ' , resulting in the formation of a PCR primer having the 
following sequence: 

CAG AGG ATC CTT AGC TAG CCA CT GAC GTC GGG G. // 
PCR was performed at bases 3375 and 4585 of pMV123, and 
BamHI and Nhel sites were added at base 3159, and an Spel site 
was added at base 4585. Digestion with BamHI and Spel, followed 
by purification resulted in a 1228/2443 Kan R cassette bounded by 
BamHI and Spel cohesive ends with the direction of transcription 
for the aph gene proceeding from BamHI to Spe I. 
E . rep . cassette 
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A DNA cassette containing the ColEI replicon of pUC19 was 
constructed by PCR with primers E.rep/Spe and E.rep/Mlu. An Spel 
site was added to the 5' end of PCR primer E.rep/Spe and an Mlul 
site was added to the 5' end of PCR primer E.rep./Mlu. The 
resulting primers had the following sequences: 

E.rep./Spe 

CCA CTA GTT CCA CTC AGC CTC AGA CCC /-? 
E.rep./Mlu 

GAC AAC GCG TTG CGC TCG GTC GTT CGG CTG. 1 3 
PCR was performed at bases 713 and 1500 of pUC19, and an 
Mlul site was added to base 713, and a Spel site was added to 
base 1500. Digestion with Mlul and Spel, followed by 
purification resulted in an E.rep. cassette bounded by Spel and 
Mlul cohesive ends with the direction of transcription for RNA I 
and RNA II replication primers proceeding from Spel to Mlul. 
M.rep. cassette 

A DNA cassette containing sequences necessary for plasmid 
replication in mycobacteria was constructed by PCR of pMV123 with 
primers M.rep/Mlu and M. rep/Bam. An Mlul site was added to the 
5' end of PCR primer M.rep/Mlu. A BamHI site was added to the 5* 
end of PCR primer M. rep/Bam. The resulting PCR primers had the 
following base sequences: 

M.reo. /Mlu 

CCA TAC GCG TGA GCC CAC CAG CTC CG 
M.rep. /Bam 

CAC CCG TCC TGT GGA TCC TCT AC 

PCR was performed at bases 134 and 2082 of pMV123 . An Mlul 
sited was added to base 2082. Digestion with BamHI and Mlul, 
followed by gel purification resulted in a 1935 base pair DNA 
cassette bounded by Mlul and BamHI cohesive ends with the 
direction of transcription for the pALSOOO 0RF1 and ORF2 genes 
proceeding from Mlul to Bam HI. 

The Kan R , E.rep, and M.rep PCR cassettes were then mixed in 
equimolar concentrations and ligated, and then transformed in 
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coli strain HB101 for selection of Kan R transformants . Colonies 
were screened for the presence of plasmids carrying all three 
cassettes after digestion with BamHI + Hlul + Spel and designated 
PMV200. An additional restriction site, Ncol, was eliminated 
from the M.rep cassette by digestion of pMV200 with Ncol, fill in 
with Klenow, and ligation and recircularization, resulting in the 
formation of pMV201. A schematic of the formation of pMV200 from 
PMV123 and pUC19, and of pMV201 from pMV200, is shown in Figure 
11. Plasmids pMV200 and pMV201 were transformed into 
smeqmatis and BCG. Both plasmids yielded Kan R transformants, 
thus indicating their ability to replicate in mycobacteria. 

A synthetic multiple cloning sequence (MCS) (Figure 12) was 
then designed and synthesized to facilitate versatile molecular 
cloning and manipulations for foreign gene expressions in 
mycobacteria, and for integration into the mycobacterial 
chromosome. The synthetic MCS, shown in Figure 12, contains 16 
restriction sites unique to pMV201 and includes a region carrying 
translation stop codons in each of three reading frames, and a 
transcription terminator derived from E. coli 5S ribosomal RNA 
(Tl). 

To insert the MCS cassette, pMV201 was digested with Narl 
and Nhel, and the resulting fragment was gel purified. The MCS 
was digested with HinPI and Nhel and, the resulting fragment was 
gel purified. The two fragments were then ligated to yield 
pMV204. A schematic of the construction of pMV204 is shown in 
Figure 13. _ 

Plasmid pMV204 was then further manipulated to facilitate 
removal of the M.rep cassette in further constructions. pMV204 
was digested with Mlul, and an Mlul - Not I linker was inserted 
into the Mlul site between the M.rep and the E.rep to generate 
pMV206. A schematic of the construction of pMV206 from pMV204 is 
shown in Figure 14, and the DNA sequence of pMV206 is given in 
Figure 15. 

5. Construction of expression cassette baaed on BCG HSP60. 
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The HSP61 cassette (Figure 6) was constructed as hereinabove 
described in Example 1. 

Each of pMV206 and the PCR cassette HSP61 was digested with 
Xbal and BamHI. The PCR cassette was then inserted between the 
Xbal and BamHI sites of pMV206, then ligated to form plasmid 
pMV261. The construction of this plasmid is shown schematically 
in Figure 17. The reading frame and the restriction sites of the 
multiple cloning site of pMV261 is shown in Figure 16, 

The E. coli lac Z gene was used as a reporter, or marker 
gene to assay the ability of the HSP61 cassette to express 
heterologous genes in BCG. A BamHI restriction fragment carrying 
the lac Z gene was cloned into the Bam HI site of Bam HI digested 
pMV261, resulting in the formation of pMV261/LZ. A schematic of 
the construction of pMV261/LZ is shown in Figure 18. The 
formation of pMV261/LZ results in a fusion between the HSP60 ane 
lac Z genes at the sixth codon of the HSP60 gene and the sixth . 
codon of the lac Z gene. pMV261/LZ was then transformed into E^ 
coli- Blue E. coli colonies were selected on x-gal plates for 
the presence of pMV261/LZ, thus indicating that the HSP60 
promoter and translation initiation sequences were also active in 
E. coli . 

pMV261/LZ was then transformed into BCG and plated on Dubos 
Oleic Agar plates containing x-gal. All BCG colonies resulting 
from this transformation exhibited blue color, thus indicating 
that the lac Z gene product (B-galactosidase) was expressed in 
BCG. SDS polyacrylamide gel electrophoresis was performed on 
lysates of the pKV261/LZ BCG recobinants, revealing that 
B-galactosida«e protein was expressed to levels in excess of 10% 
of total BCG protein (as determined by staining with Coomassie 
brilliant blue). These data indicated that BCG HSP61 expression 
cassette was functional in expression vector pMV261. 

Plasmid pMV261/LZ was then shown to replicate autonomously, 
and express the E. coli B-galactosidase, or lacZ gene, driven by 
the BCG promoter HSP60, in m. smeamatis and BCG. 
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^ Transfer of my cobacteriophage L5 iiiteg rafeion sequences 

to BCG expression vector. 

Plasmid pMH9.4, which includes the mycobacteriophage L5 attP 
site, and the L5 integrase gene, was employed in providing the L5 
integration sequences to a BCG expression vector. The 
construction of pMH9.4, as well as its integration into VL 
smegmatis and BCG, is described below in sections (i) through 
(vi). 

< 4 > Identification of the DNA sequ ences of the attachment sites 
attB, attL, and at tR. of M. smeomatis. 

Using standard technologies, a lambda EMBL3 library was 
constructed using chromosomal DNA prepared from mc 2 61 (a strain 
of M. smegmatis which includes an M. ameomat^ chromosome into 
which has been integrated the genome of mycobacterial phage L%) 
and digested with Bam HI. Phage L5 contains DNA having 
restriction sites identical to those of phage LI (Snapper, et al. 
1988), except that L5 is able to replicate at 42°C and phage LI 
is incapable of such growth. This library was then probed with a 
6.7 kb DNA fragment isolated from the L5 genome that had been 
previously identified as carrying the attP sequence (Snapper, et 
al 1988). One of the positive clones was plaque purified, DNA 
prepared, and a 1.1 kb Sal I fragment (containing the AttL 
sequence) sub-cloned into sequencing vector pUC119. The DNA 
sequence of this fragment was determined using a shotgun approach 
coupled with Sanger sequencing. By isolating and sequencing the 
attL junction site and comparing this to the DNA sequence of L5 
that was available, a region was determined where the two 
sequences aligned but with a specific discontinuity present. The 
discontinuity represents one side of a core sequence, which is 
identical in AttP, attB, and attL. The region containing the 
recombinational crossover point is shown in Figure 19. 

The attL DNA (1.1 kb Sal I fragment) was used as a probe to 

2 

hybridize to a Southern blot of Bam HI digested me 6 DNA, which 
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is a strain of M. smeqmatis which includes an M. smecmtatis 
chromosome without any phage integration (Jacobs, et al, 1987, 
hereinabove cited.)- A single band of approximately 6.4 kb was 
detected corresponding to the attB sequence of M. smeqmatis . 
This same attL probe was used to screen a cosmid library of mc 2 6 
(provided by Dr. Bill Jacobs of the Albert Einstein College of 
Medicine of Yeshiva University), and a number of positive cosmid 
clones were identified. DNA was prepared from these clones, and 
a 1.9 kb Sal I fragment (containing the attB site) that 
hybridizes to the attL probe was subcloned into pUC119 for 
sequencing and further analysis. The DNA sequence containing the 
core sequence was determined and is shown in Figure 19. The core 
sequence, which is identical in attP, attB and attL, has a length 
of 43bp. 

The mc 61 lambda EMBL3 library was then probed with the 
1.9kb Sail fragment containing the attB site. Positive plagues 
were identified, DNA was prepared, and analyzed by restriction 
analysis and Southern blots. Lambda clones were identified that 
contained a 3.2kb Bam HI fragment containing the putative attR 
site. The 3.2kb Bam HI fragment was purified and cloned into 
pUC119 for sequencing and further analysis. 

fii) Determination of a ttP- integrase regio n of L5 genome. 

Concurrent with the above procedures, a significant 
portion of the DNA sequence of L5 had been determined and 
represented in several "contigs" or islands of DNA sequence. 
Sequences of the 6.7kb Bam HI fragment hereinabove described were 
determined" by (a) analysis of the location of Bam HI sites in the 
contigs of the DNA of L5, and (b) by determining a short stretch 
of DNA sequence from around the Bam HI sites of plasmid pJR-1 
(Figure 24), which carries the 6.7kb Bam HI fragment of L5. 

A segment of DNA sequence was located that represented the 
6.7kb Bam HI fragment of phage LS. Studies of other phages have 
shown that the integrase genes are often located close to the 
attP site. It was thus determined that the L5 integrase (int) 
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gene should lie either within the 6.7kb Bam HI fragment or in a 
DNA sequence on either side of it. The DNA sequence in the 
regions was then analyzed by translating it into all six possible 
reading frames and searching these amino acid sequences for 
similarity to the family of integrase related proteins, and 
through computer-assisted analysis of the DNA sequence. As shown 
in Figure 20, there are shown two domains of reasonably good 
conservation among L5 integrase and other integrases, and three 
amino acid residues that are absolutely conserved in domain 2. 
(See Yagil, et al., J. Mol. Biol. . Vol. 207, pgs. 695-717 (1989), 
and Poyart-Salmeron, et al., J. EMBO. . Vol. 8, pgs. 2425-2433 
(1989)). A region was identified, and analysis of the 
corresponding DNA sequence showed a reading frame that could 
encode for a protein of approximately 333 amino acids. These 
observations identified the putative int gene. 

The location of the int gene was not within the 6.7kb Bam .HI 
fragment; however, it was very close to it with one of the Bam HI 
sites (that defines the 6.7kb Bam HI fragment) less than 100 bp 
upstream of the start of the gene. Analysis of the Bam HI sites 
showed that the int gene lay within a 1.9kb Bam HI fragment 
located adjacent to the 6.7kb Bam HI fragment. This 1.9kb Bam HI 
fragment was cloned by purification of the fragment from a Bam HI 
digest of L5 DNA and cloning into pUC 119, to generate pMHl 
(Figure 25). 

From a combination of the above approaches, a schematic of 
the organization of the attP-int region of L5 was constructed 
(Figure 26), and the gene sequence of the attP-int region is 
given in Figure 22. 

(iii ) Construction of pMH5 . 

The 6.7kb Bam HI fragment of mycobacteriophage L5, which 
contains the attP site, as hereinabove described, was cloned into 
the Bam HI site of pUC 119 (Figure 23). This was achieved by 
purifying the 6.7kb Bam HI fragment from a Bam HI digest of L5 
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DNA separated by agarose gel electrophoresis and ligating with 
Bam HI cut pUC 119. DNA was prepared from candidate recombinants 
and characterized by restriction enzyme analysis and gel 
electrophoresis. A recombinant was identified that contained the 
6.7kb Bam HI fragment of L5 cloned into pUC 119. This plasmid 
was named pJR-1, as shown in Figure 24. 

Analysis of DNA sequence data from a project to sequence L5 
showed that a 1.9kb Bam HI fragment adjacent to the 6.7kb Bam HI 
fragment hereinabove described contained the integrase gene. 

A plasmid containing a 1.9kb Bam HI fragment containing the 
DNA encoding for the integrase cloned into the Bam HI site of pUC 
119 was constructed. The 1.9kb fragment was purified from a Bam 
HI digest of L5 DNA and cloned into the Bam HI site of pUC 119. 
Construction of the recombinant was determined by restriction 
analysis and gel electrophoresis. This plasmid was called pffifl, 
the construction of which is shown schematically in Figure 25. 

pJR-1 was then modified by digestion with EcoRI and SnaBI 
(both are unique cloning sites), between which is a Bam HI site. 
The EcoRI-SnaBI fragment, including the Bam HI site was excised, 
and the, plasmid was religated to form plasmid of pMH2, which 
contains on Bam HI site compared to two Bam HI sites contained in 
pJR-1. A schematic of the construction of pMH2 is shown in 
Figure 26. 

The 1.9kb Bam HI fragment, which includes the integrase 
gene, was purified from a Bam HI digest of pMHl and ligated to 
Bam HI digested pMH2. Recombinants were identified as above and 
the orientation of the 1.9kb fragment determined. A plasmid 
called pMH4 was thus constructed (Figure 27) in which the region 
from the Sna BI site (upstream of attP) through to the Bam HI 
site (downstream of the integrase gene) was identical to that in 
L5. 

pMH4 was digested with Hindi I I (unique site) and was ligated 
to a Ikb Hindlll fragment purified from pKD43 (supplied by Keith 
Darbyshire of the Nigel Gindley Laboratory) that contains the 
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gene determir. ig resistance to kanamycm. .RecoiHbinantjs ,wer* 
identified arid characterized as above. This- 'plasmid is e'ali^a 
pMH5. A schematic of the construction of pMH5 is shown in Figure 
28. 

fiv) Integration of pMH5 into attB of M. smecnnatis. 

Plasmids pYUB12 (a gift from Dr. Bill Jacobs, a schematic of 
the formation of which is shown in Figure 1), pMDOl (Figure 29), 
and pMH5 were electroporated, with four different concentrations 
of plasmid DNA over a 1,000-fold range, into M. smecnnatis strain 
mc 2 155, a strain which is able to support plasmid replication. 
In sections (iv) through (vi), all electroporation procedures of 
M. smecnnatis , or of BCG, were carried out as follows: 

Cultures of organism were grown in Middlebrook 7H9 media, as 
described by Snapper, et al. (1988), harvested by centrifugation, 
washed three times with cold 10% glycerol, and resuspended at 
approximately a 100 x concentration of cells. 

1 Ml of DNA was added to 100 \xl of cells in an ice-cold 
cuvette and pulsed in a Bio-Rad Gene Pulser, and given a singTe 
pulse at 1.25 lev at 25 |iF. 1 ml of broth was added the cells 
incubated for 1 hr. at 37°C for expression of the antibiotic- 
resistant marker. Cells were then concentrated and plated out on 
Middlebrook or tryptic soy media containing 15 |ig/ml kanamycin. 
Colonies were observed after 3 to 5 days incubation at 37 °C. 

Each of pYUB12, pMDOl, and pMH5 carries kanamycin 
resistance. Plasmid pYUB12 carries an origin of DNA replication, 
while pMDOl lacks a mycobacterial origin of replication. Plasmid 
pNH5 does not carry a mycobacterial origin of replication, but 
carries a 2kb region of phage L5 which contains the attP site and 
the integrase gene (Figure 22). The number of transformants were 

linear with DNA concentration. Plasmid pYUB12 gives a large 

5 2 
number of transformants (2 x 10 per \ig DNA) in mc 155, while 

pMHS gives 6 x 10 4 transformants per pg DNA, and pMDOl gives no 
transformants . 
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The above experiment was then repeated by electroporating 

the plasmids pYUB12, pMDOl, and pMH5 into H. smegma tis strain 

mc 2 6, which does not support plasmid replication. No 

transformants in mc 2 6 were obtained from pYUB12 or pMDOl, while 

4 

pMH5 gave approximately 10 kanamycin resistant transformants in 
mc 2 6 per ug of DNA, thus indicating integration of pMH5 into the 

2 

mc 6 chromosome. 

2 

DNA from six independent pMH5 transformants (four in mc 155 
and two in mc 2 6) was prepared. These DNA' s (along with DNA from 
both mc 2 155 itself, and mc 2 155 carrying the plasmid pYUB12) were 
digested with a restriction enzyme, and analyzed by Southern blot 
and hybridization with the M. smecrmatis 1.9kb attB probe 
hereinabove described. As shown in Figure 30, all six 
transformants have integrated into the attB site, resulting in 
the production of two new DNA fragments with different 
mobilities. If pMH5 did not integrate into the attB site, it 
would be expected that a single band, corresponding to the attB 
site in the mc 2 155 control, would be obtained, 
(v) Construction of PMH9.2 an d DMH9.4 

pUC119 was digested with Hindlll, and a lkb Hindi 1 1 
fragment, containing a kanamycin resistance gene, purified from 
pKD43, was ligated to the Hindi I I digested pUC119 to form pMHS 
(Figure 31). A 2kb Sail fragment (bp 3226-5310), which carries 
the attP and integrase gene from Sail digested pMH5, was purified 
and inserted in both orientations relative to the vector backbone 
of Sail digested pMH8 to form plasmids pHH9.2 and pHH9.4 (Figures 
32 and 33). 



M. 



tecmatis strain mc 2 155 cells carrying, as a result of 



2, 



electroporation, plasmid pWB 12, pMH9.2 or pMH9.4, or strain mc 6 
cells carrying plasmid pHH5, as a result of electroporation as 
hereinabove described, were grown to saturation in broth with 
kananmycin. Cultures were then diluted 1:100 into broth without 
kanamycin and grown to saturation. Two further cycles of 
dilution and growth were done, corresponding to about 20 
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generations of bacterial growth. Cultures were plated out to 
single colonies on non-selective plates, and approximately 100 of 
these colonies were patch plated onto both non-selective and 
selective plates. The % of colonies that were sensitive to 
kanamycin, thus corresponding to the percentage of cells which 
lost the plasmid, is given below in Table I. 

Table I 

% loss 



pYUB12 (mc*155) 35 

pMH5 (mc 2 6) 17 

PMH9.2 (mc 2 155) 3 

pMH9.4 (mc 2 155) 0 



(vi) Transformation of BCG with DMH9.4 

The 1.9 kb Sal I fragment, which includes the N. smeomatis 
attB site as hereinabove described was cloned into pUC119, afid 
the plasmid generated was named pMH-12. (Figure 34). 

Gel purified Sal I 1.9kb M. smeomatis fragment containing 
attB (isolated from pMH-12) was used to probe a Southern transfer 
of Bam HI digested mycobacterial DNA's, including BCG substrain 
Pasteur, shown in Figure 35. This demonstrated that there is one 
Bam HI fragment of BCG that strongly hybridizes to the 
smeomatis attB probe and three hybridize weakly. The strongest 
hybridizing band is the fastest moving band (approximately 1.9 
kb). 

The same probe as above was used to probe a BCG cosmid 
library (provided by Dr. Bill Jacobs) and positive clones were 
identified. DNA was prepared from several positive clones and 
analyzed by restriction analysis and Southern blotting. The 1.9 
kb Bam HI fragment (corresponding to the strongly hybridizing 
band in the Southern blot was identified, gel purified from the 
cosmid DNA and cloned into pUC119. The resulting plasmid was 
named pMH-15. (Figure 36). 

Plasmid pMH-5 and pMH9.4 were elect roporated into BCG 
Pasteur. It was observed that pMH9.4 transforms BCG with high 



SUBSTITUTE SHEET 



WO 92/21376 



PCT/US92/04538 

-30- 



efficiency (approximately 10 transformants/ng DNA) , while pMH-5 
transforms BCG at low efficiency (1-10 transformants/ug DNA). 
DNA was prepared from BCG transformants and analyzed by Bam HI 
restriction and Southern blot analysis, probing with gel purified 
1.9kb Bam HI BCG attB fragment from pMH-15. These data are shown 
in Figure 37 and show that integration of both pMH5 and pMH9.4 is 
specific to the BCG attB site (ie. the strongly cross-hybridizing 
fragment in BCG). This is illustrated by the loss of the 1.9kb 
Bam HI fragment from the transformants and the appearance of two 
new bands representing attL and attR junction fragments. Figure 
37 shows just one of the pMH5/BCG transformants, although all of 
the four that were analyzed show that one of the bands (the 
largest) is smaller than expected (and different in each of the 
transformants ), indicating that the transformation effiency of 
pMH-5 is low in BCG. In contrast, the four pMH9.4 transformants 
are identical to each other (Figure 37) and give attR and attL 
junction fragments of the predicted sizes. 

Plasmid pMH9.4, which includes the mycobacterial phage L5 
attP site and the L5 integrase gene, was digested to completion 
with either Kpnl + PvuII or Xbal ♦ PvuII, and a restriction 
fragment of 1862 or 1847 base pairs, respectively, each of which 
contain the attP site and the integrase gene, were purified by 
agarose gel electrophoresis. Plasmid pMV261/L2 was digested with 
Xbal or Oral to generate either a 7569 bp or 7574 bp vector 
fragment. The 7569 bp fragment was ligated to the 1862 bp 
fragment derived from pMH9.4 to form pMV460/L2. The 7574 bp 
fragment was ligated to the 1847 bp fragment derived from pMH9.4 
to form pMV460 R/LZ. Plasmids pHV460 F/LZ and pMV460R/LZ each 
include a mycobacterial replicon, the L5 attP site, and the L5 
integrase gene. A schematic of the formation of plasmids pMV460 
F/LZ and pMV460R/LZ is shown in Figure 38. To generate 
derivatives without the mycobacterial plasmid replicon, plasmids 
pHV460/LZ and pMV460/LZ were digested with Not I and 
recircularized by ligation to generate pMV360F/LZ and pMV360R/LZ. 
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A schematic of the construction of pMV360F/LZ and pMV360R/LZ is 
shown in Figure 39. 

Plasmids pMH9.4, pMV261/LZ, pMV460/LZ, pMV460/LZ, pMV460/LZ, 
and pMV460/LZ were then transformed into M. gmegmatis and BCG to 
test their ability to replicate autonomously or integrate into 
the H. — smegmatis or the BCG chromosome. Transformation with 
PMH9.4, pMV261/LZ, pMV360F/LZ, and pMV360R/LZ yielded kananmycin 
resistant transformants of M. amegmatis and BCG. Transformants 
of pMV261LZ, pMV360F/LZ, and pMV360R/LZ were shown to express 
coll B-galactosidase by SDS-polyacrylamide gel electrophoresis 
and X-gal assay. Plasmids pMV460F/LZ and pMV460R/LZ failed to 
yield kanamycin resistant transformants, thus indicating that 
chromosomal integration of a plasmid carrying sequences mediating 
autonomous replication is lethal to mycobacteria. 

7. Construction of PMV307. 

Plasmid pMV206 was digested with NotI to remove the 
mycobacterial replicon. The resulting 2209 bp fragment, which 
includes the aph (Kan R ) gene, the E. coli replicon and the 
multiple cloning site, was ligated and recircularized to form 
PMV205, the construction of which is schematically depicted in 
Figure 14. 

PCR with primers Xbal-Att/Int and Nhel-Att/Int was then 
performed on a Sal I fragment from pMH9.4, which contains the 
attP site and the L5 integrase gene. The resulting cassette was 
then digested with Xbal and Nhel and a 1769 bp fragment was gel 
purified. pMV205 was then digested with Nhel, and the resulting 
fragment was ligated to the 1989 bp fragment obtained from pMH9.4 
to form pMV307. A schematic of the construction of pMV307 is 
shown in Figure 40. 

8. Construction of pMV261/HIV1-op 120 . 

An Smal-Clal antigen gene fragment, or cassette, was 
constructed by PCR, and cloned between the Bam HI and Clal 
restriction sites of pMV261 to form pMV261/HIVl-gp 120. 
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Plasmid pMV261/HIVl-gp 120 was transformed into BCG, and the 
presence of the corresponding antigen in BCG was verified by the 
appearance of immunoreactive protein bands in Western blot 
analysis of BCG recombinant lysates. 
9. Construction of pMV361/HIVl-gp 120 

The HIVl-gp 120 antigen gene expression cassette, which 
includes a promoter sequence and an HIVl-gp 120 gene sequence, 
was excised from the pMV261 derivatives with Not I and a second 
restriction enzyme site (Pvu II, Eco RI, Sal I, Cla I or Hind 
III) and cloned into the integrating plasmid pMV307 between the 
NotI site and a second enzyme site (Pvu II, Eco RI, Sal I, Cla I 
or Hind III) to form the plasmid pMV361/HIV-Igpl20. The backbone 
of this plasmid is shown in Figure 41. 

Plasmid pMV361/HIV-Igpl20 was transformed into BCG and shown 
to express the corresponding antigens by Western blot analysis 
(Figure 42) with the appropriate anti gen- specif ic human sera. 

Example 4 

Cytotoxic T lymphocyte responses to HIV-l-gp 120 

HIV-1 gp 120 was expressed in BCG as a six amino acid fusion 
protein with BCG hsp 60 protein using vector pMV361/HIV-l-gpl20, 
using the hsp 60 promoter to control expression. 

Two groups of mice were inoculated with 1 X 10 6 CFU's of 
recombinant BCG expressing the gp 120 gene from the integrative 
plasmid pMV361/HIV-l-gpl20. One group received the BCG via 
intraperi tonal injection (100 ul) whereas the other group 
received the BCG by deposition of the dose (10 ul) rubbed into a 
tail scratch. 

CTL activity was measured at various times after 
immunization. CTL activity was measured as follows: 

Two mice from each group were sacrificed at various times 
after immunation, and the spleens were removed. Single cell 
suspensions were made and the red blood cells were lyzed with 
ammonium chloride. The cells were stimulated in vitro for 5 days 
with P815 cells- that were pulsed with peptide P18, a fifteen 



SUBSTITUTE SHEET 



WO 92/21376 



PCT/US92/04538 



-33- 

residue synthetic peptide within HIVl-gp 120. A 4-hour chromium 
(Cr ) release assay was then carried out using untreated P815 
and peptide P18 pulsed P815 cells as targets. Significant 
P18-specific CTL activity was observed in the mice immunized by 
tail scratch 14 weeks after immunization. At 16 weeks, CTL 
activity was observed in both groups of mice. Upon repeat of 
this experiement, CTL activity was observed at time points as 
early as 8 weeks after immunization. 

Example 5 

Recombinant BCG transformed with pMV361/HIVl-gp 120 were 
grown to mid- log phase in Dubos media and concentrated by 
centrifugation. The bacteria were then resuspended in 15% 
glycerol and frozen using a rate contolled freezing apparatus. 
The bacteria were stored at -70°C until use (referred to as 
"vaccine"). A second preparation grown in the same way was not 
frozen and is referred to as a "fresh" preparation. Prior to 
immunization of animals, the bacteria were resuspended in PBS + 
0.05% Tween 80 to the desired concentration and cup sonicated 
briefly to disperse clumped bacteria. Six week old BALB/c mice 
were inoculated with a single dose of 5 x 10 4 cfu fresh bacteria 
(determined post inoculation) or 1.5 x 10 5 frozen bacteria 
(determined pre-inoculation) by tail scratch (t.s.) injection. 
At 8 weeks post-immunization, splenocytes were harvested from 
animals and CTL activity was measured (described below) . 
Splenocytes from unimmunized animals were used as controls in the 
CTL assays. 

CTL activity was determined as follows: 

Splenocytes (ACK-treated, 5 X 10 6 /ml) were stimulated in 
vitro in 10 ml in upright T25 flasks by co-culture for 5 days 
with mitomycin C-treated P815 cells (5 X 10 6 /ml) that were pulsed 
with 250 ug/ml of peptide P18 for one hour. A 4 hr. 51 Cr release 
assay was subsequently performed in triplicate using P815 targets 
with or without pulsing for 1 hour with 250 ug/ml peptide P18. 
Various effector- target ratios were tested using 5000 
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targets/well . Specific lysis was calculated as follows: % 
specific lysis = 100 X [release by effector cells minus 
spontaneous release/maximal release minus spontaneious release] . 
The results are given in Figure 42. 

As shown in Figure 43, both groups of mice showed an 
increased CTL response at 8 weeks after immunization as compared 
with unimmunized mice. 

Example 6 

Recombinant BCG transformed with pMV361/HIVl-gp 120. were 
grown to mid- log phase in Dubos media and concentrated by 
centrifugation. The bacteria were then resuspended in 15% 
glycerol and frozen using a rate contolled freezing apparatus. 
The bacteria were stored at -70°C until use (referred to as 
"vaccine"). A second preparation grown in the same way was not 
frozen and is referred to as a "fresh" preparation. Prior to 
immunization of animals, the bacteria were resuspended in PBS + 
0.05% Tween 80 to the desired concentration and cup sonicated 
briefly to disperse clumped bacteria. Six week old BALB/c mice 
were inoculated with a single dose of 5 x 10 4 cfu fresh bacteria 
(determined post inoculation) or 1.5 x 10 5 frozen bacteria 
(determined pre-inoculation) by tail scratch (t.s.) injection. 
At 8 weeks post- immunization, splenocytes were harvested from 
animals and CTL activity was measured (described below) . 
CTL activity was determined as follows: 

Lymph node cells (5 x 10 6 /ml) were stimulated in vitro in 10 

ml in upright T25 flasks by co-culture for 5 days with mitomycin 

C-treated P815 cells (5 X 10 6 /ml) that were pulsed with 250 ug/ml 

51 

of peptide P18 for one hour. A 4 hr Cr release assay was 
subsequently performed in triplicate using P815 (matched ) or EL4 
(mismatched) targets with or without pulsing for 1 hour with 250 
ug/ml peptide P18. Various effector target ratios were tested 
using 5000 targets/well. Specific lysis was calculated as 
follows: % specific lysis = 100 x [release by effector cells 
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minus spontaneous release/maximal release minus spontaneous 
release] . 

The results of this assay are given in Figure 44. 

As shown in Figure 44, a CTL response to HIV-1 gp 120 using 
lymph node cells was demonstrated following immunization of mice 
with BCG transformed with pMV361/HIV-l gp 120. 

It is to be understood, however, that the scope of the 
present invention is not to be limited to the specific 
embodiments described above. The invention may be practiced 
other than as particularly described and still be within the 
scope of the accompanying claims. 
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WHAT IS CLAIMED IS: 

1. A method of inducing a CTL response in an animal 
comprising: 

administering to an animal mycobacteria transformed 
with at least one DNA sequence which encodes a protein or peptide 
or fragment or derivative thereof which includes an epitope which 
is recognized by cytotoxic T lymphocytes, said mycobacteria being 
administered in an amount effective to induce a CTL response in 
an animal. 

2. The method of Claim 1 wherein said protein or peptide 
or fragment or derivative thereof includes an epitope which is 
recognized by cytotoxic T lymphocytes induced by an HIV protein 
or fragment or derivative thereof. 

3. The method of Claim 2 wherein said protein or peptide 
or fragment or derivative thereof is an HIV protein or fragment 
or derivative thereof. 

4. The method of Claim 1 wherein the mycobacteria are of 
the species M.bovis -BCG. 

5- A composition for inducing a CTL response in an animal, 
comprising: 

mycobacteria transformed with at least one DNA sequence 
which encodes a protein or peptide or fragment or derivative 
thereof which includes an epitope which is reocognized by 
cytotoxic T lymphocytes, and an acceptable pharmaceutical 
carrier, said mycobacteria being present in an amount effective 
to induce a CTL response in an animal, 

6. The composition of Claim 5 wherein said protein or 
peptide or fragment or derivative thereof includes an epitope 
which is recognized by cytotoxic T lymphocytes induced by an HIV 
protein or fragment or derivative thereof. 

7. The composition of Claim 6 wherein said protein or 
peptide or fragment or derivative thereof is an HIV protein or 
fragment or derivative thereof. 
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8. The composition of Claim 5 wherein said mycobacteria 
are of the species M.bovis -BCG. 
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£ Cn^TAGCAATTXrTCGffiXXH^ 

$ GAATATOGTTACGGAGOTACXX1A CTtXXXXTTGAAAOGCGCGGCXJOGTT 

• L L A SMA SA SKARRAV 
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jf GOCCGCACn^GTGlGGOCrTXX7rAGAOCACGATCCX*GTO 

^- + + ♦ •- + -- .- ..._- + 2100 

2 CGCtXX3*IGAC*rcACACCGGAGCATCTC"GTGCT^ 

R A S LTAEYVVI GDAWI 

ACQTEGRLGRDRGG LE 
G-APCR MUTAGEN OS IS TQ _ 

pV* Vj['0-300S IRocE 
CTCGCTGTCCGGTAGCGGTCCCGGACACAUU'lLt/llGCACG^ 

♦ + • + + 2200 

GA GOGA(^GCCCATOGCCAGCtXCn/l XJ I l^AGCAACGTGCCCTTAAGC 

ESDPLPGPCVDNCPFEA 
R Q G TAT RSVRRQV P IR 

TltXTGCGTCGCCACCTAGATDCGCATGAGGGCCOGACGATAGGC^ 

+ ' ♦ + + -- + 2300 

CAAGGaX^GaXTITX^TCTAGGCXTrAC^^ 

XRTP LYIRMLA PRYAWL 
EPDAPLDAEPRSSLGvy 

crriGTACTiwiLiu:iuAtt*cx*AGa^ 

♦ + + + + 2400 

GAAACATCCAOCAGACGACTGCGGTCXJOGro 
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TXXXTTGCTGCCAGTGGCGATAAGTOGTGTCTra^ GAC 

1501- .- + --- -- + -- + + - * * * * " + ' 

ACCGACGA(XGTCAOCGCTATIX^GCACAGAATGGC(X!AAOT 

CACACAGOCXIAGCTrCGAGCX3AACGACXTACACXXL\ACrcA ACA 

ieoi + + + + + 

CIXTKTTOGGCTXXMACCTCGCTKKrKX^TGTGGCTTG 

AGGTATCOGGTAAGGGGCAGGGIQCGAACAGGAGAGCGCACGAGGGAGCr CD 

TCTGACITGAGCCTrCGAril 1 I t; 1 GATGCTCGTCAGGGCGGCGGAGCCTA • 

1801- + ♦ + + + * — 

AGACTCAACTCGCAGCTAAAAAC^CTACGAGCAGTCXXXXXXXXn U_ 

GC ClTntX ' ll^ CA' lt/niTril CIXXX^ 

1901 + + + + +' X 

CGGAAAACGAGTGTACAAGAAAGGACGCAATAGGGGACTAAGACACCTA'I'I h- 

EMP E.rep BEG) \ Nl M. re P £ 

_iu,i,m i,t,o, N I,0,l,M — 

ACO^GOGCAACCCX7nXXXXXXX^(XX:GTGAG<XCACrAGCrc J 

2001 + ♦ + + (- 

TGGCIXXXX7ITGCGCA(XOCGGCGTGCCX^CTCXXX7IXXrrc < 

ACGGGTCTAAGGCXXrariTrrACGGCaX^ 2 

2101- + + + + + 

TGCCX^GATOXXXXKCACATGC^ 

' TGGGWnXXHCGGCTGTOGCIXXn^^ 

2201 ♦ ♦ ♦ ♦ ♦ 

AOCOCCAOJAG/XX^CAGaJACCACAAGG^ 

TGGAGCTCGTCItXXSAa^TACACCGGTC^ 
2301 + + + + + . 

ACCTCGAGCACAGOCTGGTATGTtXXXTACTAATTACCACCAGATfiATXXITI 

GCCGCTGGCAAGCGAOGATOTXHT^^ 
2A01 - + + + + + - 

CGGCGACCt7nXXXntX7TAGAACGAGCTT:(Xt7rAGATGGOC 

AACCTGCTGGTCCTGGACCTAG ACCATCCAGA CCCAGCGCTCCGA GCGCIt ^ 

2»1 + + + + + . 

ITGGACGACCAGCACCTGCATCTGGTAGGTCIGCGTCXXTGAGGCIXTXrGAG Q 
CCAA(XGCCACGC^CACGCAGTGTGGGCACTCAACGC^ J* 

2601 + + + + U) 

CGTItXrGClXXXntnXXXn^ACACOCGTCAGTItXXX^ — 

AGGOmCGGCGCXXXX7IXX;A'IXXXX;AC<XX:AGriTACirAGG<:CIX"AlX;A 

2701 + ♦ + + + . r\ 

TCCGGAAGCCGCGCGGO\GCTACCGCTGGCGTCAATGAGTCCGGAGTAC \J 

CICTACACACICAGCCACATCGAGCXXX^G — 

2801 + + + + + • [1 

GAGATGrrCTGAGTCGGTGTAGCrOCGGCTXXAGCCGCGCTTGTACGGTGG 

GGCGGAATTGCGCACTCTTCATTOXTCAGGTrG^ 

2901 + + + + + " 

CCGCCITMCGCGTGAG\AGCrAAGGCAGITXAA(^ 

CC^ATCrATGOCXJACTGCCACC^^ 

"vni + + + * 

GCGCrAGATACGGCTT^COX;'iX;CXX^ 

AGCATTTGGOGTrcGATCACAACXrAAGTCGCGCATTTGGG<XGAC^ 

3101 + + + + + 

TCGTAAACCGCAACXTAG"! I/ 1 U<J 1 ' 1 CAGCGCGTAAACCCGCCTGCCCTAG 
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X I ^7 

GATAGTTAOCGGATAAGCXXXIAGCCCTCGGGCTGAACXXXrGGGTTCXTrG 

+ + + + + ieoo 

CTATCAATGGCCTATTCCGCGTCGCCAGCCXX^CT^^ 
GCCTCAGCATTCACAAACOCCCACCCTrCOOCAACKWAGAAACGOCCAC 

+ + + + + 1700 

TCCCACTCGTAACTCITrCGCGGTGCGAAGCGCl UXV lll l l OCGCCTG 
* TCC^GGGGGAAACGCXTTCGTATCTTTATAGTCCIt^^ 
0 ■•■ * + 1800 



12 GGTCCCXXnTIXX^GGACC^TAGAAATATCAGGACAGC(XIAAAGOtX»roG 

TGGAAAAACCKTCAGCAACGCGGCCM'ririACXTITCCIXKXXrillGCrG 

J£ + + + + + 1900 

U_ ccriTrrcOGGTCXnTCCGaXXJAAAAATGOCAAGGACOCG^ 

I CCGTATTACCGCCTTC AGTCA GCTGATACXXXniXXXXJCAGCCGAACG 

I— + + -- + — .-....+........-+ 2000 

> GGCATAAl GGCGGAAACIXL^CnXACTAlXKXTGAGCGGOGTCGGCrrCC 



< 
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U 

h TCGGGCGCTGTCTrGGCTCX7rAC(XGCGCATTCAGGCGGCAGGGGG^ 

+ + + + + 2100 

AGCOCGCGACAC^CXX^GCATGGGCC« 

GAAACGTCCrcGAAACGACGCATXjrcTrcCICCTC^ 

+ • + + + + 2200 

CTTTGCAGG AGCJ Tl tX'fGCGTACACAAGGAGGACCAACX^VTGTCCACCA 
A GCGGGGGAiV I u i \jCAu 1 1 u i uGGGTtXXXJL.Vlt^GCG AAATATCTTIACT 

+ + + --------- + + 2*30 

TCGCCCCCTX^CACGTCAACACCCCACCGCXXJAGT^^ 
GCGTGAGCXaCCTCGCOGAaifWTITO 

+ - - + — + -.. ---+-. ....... + 2K0 

(J) tXX^CTCGGTGCAGCGGCIXKnTAAAClXXriXX^ 
CXntTXXXCrACXXXGCCGGTACATro^ 

GCAGOCGGGATCCXGCGGCCATGTAGCTCCXICITGGGTTC 2330 
VVGCGCOXGGGGTCCCATCCCCT^^ 

•-- + + + + + + 2600 

TCXX:GGGCCC01\GGGTAGGCGACGGGTTGOGCrAGCAO^ 
,CCX>AATACGCGCGGCXnAAGCfXXnTXX 

+ + + + +2700 

TGGCTTATr/TXXXrGCATIXXXXXMGCX^ATGTACaxrGCA 

CCAAAAACCCOICXX^CATCGCCTCG 

+ + + + + 2800 

ITTnTnGGGGOCGGTCTAGCCGACCCTITGCCITACCGAGG 
GCCGCGCTGGCGiraGCAGACCtCGrc^^ 

+ + + + + 2900 

OGGCGCCACOGCAGTCXTICTXXnraVOT 

CCCTCATCCGCATCTACCTGCaiACCCGGAACGTGGACGGACTCG 

i- i- - + -• + + 3000 

GGGAGTACGCCTAGATCGACXXXTITXXXXTnTJ^ 

ACTTITriTTTCCCGGACCGCTAaX^ 

+ + - + + + + 3100 

lXX*CACAC.GGCCTCGa!ATGGOCT^ 

CGTGGTCTArcAGGCCACACrcAGTGCGCGOCAGTCXXXXATC^ 

+ + + --- + +3200 

CACXZAGATGCTOCXXntTTGAGTCACGCGCGGTCAGCCGGTAGAGOGCCT 
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GIXnGGAlGGTCTACGTOCACGACCAGCAGGT^^ 

2301- + + + + 

CAGACCTACCAGATGCAttTGCIXXAXJJlXJUUJiCGGTCGCXlACAAC 
dDPNDVDVVLLNALATP 

ATCCCCrcGAGCAGAlUUl U^CII GOCAGCGCOCAGTACGGCAGOCA 

2801- + ------ - - " + + + 

TAGGGGAGCTOGTCTAGCAGCGAACGGTCGCXXXTrcATOCCX^ 

d GE LLDDS ALPWY PLW 

TAATCAOOGGItnATCXnOCCACACGAOCIGCAAGTCAGATATTTOG 

2S0h + -- + + — + 

d ATIAGTCGOCACATAOCAGGCnrrGCrcCAGGTTCACTCTATAA^ 
TVPTNDSVLE LDSI ES 

TCATGAAACAa^CCCACAGOXAG 
3001- » + ......... + --- + 

dS S VGA V A SCGWGG TG 
CTGTCXXXXXXXrTAC^CGCXJUOJll AGACXXXTITAGACtXXXTIGCOGO 

3101- + + + + 

GAGACCGCOGGCATGTGOCXXXXJAATCrcGG€AATCrGGGGCACGGC 



Clt^CrriGGCrmTCGCGTAOGAATCGGCTUItnt^ CD 

3201- + + + + SI 

iaGlGGACXXUAATACCGCXT«TrAG <0 

O^GGCCGCAaXXXXXXXXXXXnGTCAACAA^^ 
3301- ........ 4. ...... --4---. ---..4.. 4.......... 

ocjocxsoacnGacoGocxxxxxx^ _ 

OCGCOCXUTGAGaXXXXXnTACGCTCXXnXXX^ u. 

3«r +- -.- • + + + • 

GGCGGGCTACTCXXXXXXXLiATCXXIAtXGACGGTCCGCAAGCGOCX^C x 

AGGCXTTOGCCCTITTTAAGGCTGAATITCCrTGTCltXX f- 

3301- + 4- + + 4 ~ 

TCCGGAGCGGGAAAAATTCOIACTTAAAOGAACAGAGGCITAGGTrcAC > 

ACACATGAOCAACriXXJATAACGTIOC^^ x 

3601- ♦ + + + + u ~~ 

TCtTGTACIGGTTG^GCTATIGCAAGAGCOSAGCTAGACGGAGCOCGCAA H 

ACAGCni/lL'l Ul ' A AG<XGATGCOXXIAGCAGACAAGCCCGTXIAGGGCGO < f\ 

3701 + + + + + 2 U 

TGTCGAACACACiTTraCCTACGCXX^ _ . 



GCGATAGOOGACIGTATAClGGCTrAACTATGCXXXIATCAGAGCAGATItn' 
3801- + ♦ 4 4 4. 

C<Xn"ATCGCCrcACATATGACXXJAATTGATACXXXGTAGTCTCGTCTAACA 

AGAAAATACOGCATCAGGOCXnxriTCCGCITCCnXXCrC^ 
3901- ♦ + + + 4 

TCTITTATCGOCTAGTCCGCGAGAACCCCAAGGAGCXJAGTC 

TAATACtXTITATCCACAGAATCAGCGGATAACXXL\GGAAAGAAC^TCTGAG 

4001 + + ♦ ♦ 4 

ATTATCCCAATAGGTCTCTTAGTCC€CT A T 

CTITrnXATAGGCTCCGCCOCCOGTACCAGCA 

4101 + ♦ ♦ -- + + 

CAAAAAGGTATCCGAGCCCCGGGGACTGCTCGTACTGTTT^ 

CCXXCItXJAAGCTCtXTCXrroOGCIXnTXTXjn^ 

4201 + 4 4 4 4 

GGaXIA(XITOGAGGGAGCACGCGAGAGGACAACGCrcCGAOGGOGAAT& 
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3<^TOCXX^XIX^TGTACX^GGCGCXX7^AGGGCCGACGC^XXX}C^ III UGCCTAG 

- + ♦ * + * 2800 

CAACCa^GCTAC*TGGaXCCGGATOCCXXX7^^ 

MAEIYRRGLASARSQRYI 

3AGClGCTCAA AT TOGTa X XX^a?ltX^ 

- + + -- -- + .. + ... — 



■-----+ + + + + 2900 

riaUCGAGTlTAAGCAGC0CXnGCA0CGAGTGO3AA0^ 

L Q E PEDAVMSVS PLLG R N 

TTUGGIXXXACCCCACAACTX^C^ 

*CTttXrGGTOCGGTGTTCACGTTnT^^ 3000 

LPWGVVACEG A R G D L G S 
CAGGAGGAACACATCCGTCCTITCX^GCiA(X7rTTCOGGGCC^ 



3100 



GTTXritrTKnX7rACXKL\GCAAAGCltXnTXL^ 
V L L F V 

CICAATCXXraXXnACGAGCX^CACAGCACa^ 

• + + + + ♦* + 3200 

* rrrtnrYYynrAir.1 -I. XXSI GTCTCGTCGGCTTGAT flTTTYYSAfY'Af'fY? 

M. rep - Mlu 



GAGGTCAGATAOG<aXTACTCA(XXntRjCAAGGGOGACACAGOOGOOOCAC 

+ + + ♦ ♦ - - - + 3300 

CIXXACICTATGCGCJGATGAGTGCGACXXTITXXXXrrc 

ACCAGCAGC'lbTl UMGGCTTGGCnXAAGTGCAGGACATCIGTCGCGAAOG 

♦ + ♦ + +3400 

«ltXnXX7IXX!ACAACCrCOGAA<XGAGCrTCAOGTCCl d'AGCAODGCTTGC 

TKGTCCCAGOGCGTCGMXXXTTAGACCOCCTOCGGIGr^ 

*■ + + - + + +3500 

C^(XAOGTCXXXXAGClTXXXXVT^^ 

CTIGTOCAAGGGTGTATCrACXKTrrAGTCCAAAGTniAAACXfA 

+ + + + +38© 

CGAA(>GGTItXXIACATAGATCOGAAIC4GGTrrCAAG' 1 1 ' HjL'1 LUXTAA 

rcGGTGATGACGGTCAAAACCTCTGACACATGCAGCTCO^ 

+ *' + + 3300 

AGCCACTAClXXXACITITCrAGAatTICT^ 

GTCAGCGGO"! 1/1'J IXXXXK7IX7TCGGGGCCC4GCLA 1 uACOCAGTCACGTA 
-••••■»«• + •••••.••«♦«••••••»«+......... + ........ - +3800 

AGTCGCa^CAACaXXX^CAGCCCCXXXT^^ 



:i,c<i,M 

ACTGAGAGTGCACCATATGCGGTTTni^AATACXrGCACAGATG 

+ -•- + + + +3900 

TGACIOCAGGTCGTATACGCCACACTrrATXXSOCTCnCT 
CGCTCXXTIXAil"ltXGCnXXXX31\GCGGTATCAGCTCA 

♦ + + •- + +4000 

GCGAGGCAGCAAGCCAGAmTTmCX^TGGTCTAGC^ 

21 CAAAAGOCCAGCAAAAGAOCAGGAACCXTrAAAAAGGCCXXXTlGCnXXr 

+ - - ♦ - + + +4 jOQ 

GTrnxrGGTXxrnrTxxxxntXTitx^ 

AAGTCAGAGGTGGCCAAACXXGACAGGACTATAAAGATACCAGGCGTIT 

+ + + + + + 4200 

CriTT^GTCTC^CCGCTTTTXX^ 

CXKL\TACCItnXXXXXnTIXnXXXTItXXXJAA 

+ + -- + + + + 4300 

SaTTATGGACAGGCGGAAAGAGGGMGCanTOX^ 
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TcccceenazicGCGajTGocGCtt 

CGGCTACAGCGACCGCTACAACOGGCAGCCGA 

3301- --• ♦ + + ' + + * 

CCOGATGTCGCIXXXX^TCTrTGGCCXTIXXKXnX^C^ 

CTOGTOO(MXTOGTGGOGOiOGAAOt3CAGCGAGlXX»CTDGOCGAGCAG^ 

3401- + ...+.. . . .. . - - - + 

CAGCAGGCCGAGCACCGOGriCCrrcCGT^ 

GGCCGCAAACGGCXIAAACATniXXXXrrGCATCIXKJA ^ 

3501- - ♦ + " * + ' * " + * + »o 

(XXXXXTITIGCCGGTTTCTAAAGCCCGACGTAGAC^^ 

AAAGGOTACAACGMGCCGACMTCCAaX^^ 0 

3601 + + + ♦ — 

TXTOCGGGTGTTXKTKXKXlXTITAGGTGGCGACMGATrc u - 



I 

CAGGTAAAAGTOCTGGTAGACGCTAU'ririt.'I'Ui*! 1 1GGGCCATGCCT ~ 

570! + + •♦■ + > 

GTCCATTnCAOGACCATCTOCOATCAAAAG^ x 

U 

GGGTTCTACG AAI L' I'l IHJ 1 CO ATACCAAGCCATTTCCGCTGAATATCG < 

3801 « - - + + -........ + »- -- -• + ..... ^ 
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TCCGCGICAGOCATGCATGGAGGCATIXXTATGAGCXxACGGCrACAGOGA 

+ + + --.. + .-. +3300 

GGOJCAGTCGGrrrACXn'ACCItXXTrAACGATACTCXXnTXXX^ 



CGTGACCGCaXXXJAAGGCGCTCXMATCACCXX^^ 

+ + + + +3400 

GCACIXXKXXXXXXnraXXX^GCTrACrcGCCrcATAGGC^ 



TG<^CGCOGCGAACGCATOCGOG^ 

+ + + + + 3330 

£ ACGTGCGGCttnTGCXnAGGCCXXa^TA^ 

O CTCGGCTATCGGGCGAGGAAAGAGCGTGCXiGCAGAACAGGAAGOGGCTCA 

.~ + + + + +3600 

GAGCCCATAGOOCGCTOCTnCKXX^OGOCGTCT^ 

X CGAGCGGGTgKXXXXXXXn^^ 

t • + + + + + +3700 

^ COCTCGCCCACAGGGa:CCCAAGGCACXXXXraW^ 

I 

O GTCTCGTTGCGTGTTTCGTItKrGCCXXTITITCAATA 

- + + + + + •-- +3800 

^ CAGAGCAAGGCACAAAGCAAGGCGGGCAAAACTrATGGTCtX^ 

GGGAGCTCACCGCCAGAATCGGTCGTroTOGTOATGT^ 

- + + + + - + +3900 

TXXTTCGAGTGGCGGTCTrAGGCACCAACACXZACrACATGCA 
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TAAATCTAGATATOCATGGATCCAGCTGCAGMT^^ 
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ATrrAGATCTATAGGTACCTAGGTOGAOGTCTTAAGCTroGAATAGCrACAG 



AAAGGCTCAGTCGAAAGACTGGGCCTTICGTITTATCTGTICT 

+ + + + + +4100 

rrcCGAGTCAGCTTTCTGACCCGGAAAGCAAAATAGACAACAAACAGGCCG 
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FIG. I5B 



90 

AAGATAAAAATATATCATC4TGAACAATAAAACIXnCIGCTTAC4TAA 

+ + + + + 100 

TICrXITTTrAT/ttAttAGrACrrGRTATrrTGACAGACGAAnXttATI 

GAGGOCGaUTTAAATTCCAACATGGATGCIX^mATATGC^ATM 

+ + ♦ + + 200 

CTXXCGOGCTAATrrAAGGriTGTACCTACGACTAAATATACCCATATr 

GGGAAGOCXXIATGOGCCACACITGTITCrGAAACATGGCAAAGGTAGC 
+ + + + + 300 

AOOCTrcGGGGTACOCGGTCrcAACAAAGACTITGTACtX»ITIt^TOC 

TTTATGCCTCITCCGACCATCAAGCATTTTATCCGTACTC^ 
+ + + ♦ + 400 

ATACGGAGAAGGCIXXTTACnTCCrAAAATAGGCATGAAGGACrACTAC 

AGAATATCCIT^TTCAGGTC AAAATATTtHTG ATGCXXTItXXIAu iu 1 1 

+ + + + + 500 

TCTTATACXIACTAAGflTXIACTITrATAACAA 

GCGTA'1'1 ICGTCTCGCIt^GGCGCAATCACX^ATGAATAACGCriTro 

. + + 600 

GfXSCATAAAGCAGAG^^ 
< TCTGCAAAGAAATGCATAAlCrTTTGCCATTCTCAO^^ 

f) .... + + + ..... .j. . .. 4- » » - - - - + 700 

~ GAC ^rriL'l'll ACgrATTAGAAAAOGGTMGAGTCCOCTAAGTCAGC 

to ATAGClTGTATItlATCTrcGACCAGTOGGAATDGCAGAOOGATAOCA 

_ + + + + + 800 

U- TTATCCAACATAACTAC^ACClWTCAGOCrrAGOGTCTCXKTATGGT 

GAAACGGCTTTTTCAAAAATATGGTATTGATAATTC^ 

X + + + ♦ + 900 

t TCTlTCOCGAAAAAGTITITATACCATAACTAmGGACrATACTTAm 

GGTITnAACACTGGCAGAGCATrACGCTGACT^^ 

1 + ♦ ♦ -- + •-- + 1000 

° CCAACATTGTGACCGTCTaTrAATCCGACroAACTGaXTGCCG^ 

k CCGACAACGCAGACanTtXXritXK^AAGCAAAAGTICAAAATCAOC 

2 + + + + +1100 

GGcnmxKxntnxxx^GGCACCGTntxrrrrrcAAC 

GCITXUTGATGGGGCGATK*GG{riGGTAT^ 

..•••A. ..••.-••+■-•--•-•-- + •••••••••4' 

CroACCTACTACCCCGCTAAGTCXXM 

X^TCAAAGGATCTICrTOAGATOCTITrTlTCTGCT ^ 
CTAGTTTOTAGAAGAA<^^ 

GAGCrACCAALMU'llllU^GAAGCTAACIXXXTIX^GCAGAGO^G 

CrcGATGGTTCAGAAAAAGGCITCCATTGACCG^ 
ACTaXTTAGCACCGCCTACATACCTCXXnxnXXTAA ^ 

TCAGACATOGTCGCCG^^ 
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no. ISA 

N 

tt-gln Kan Cmrtt- 

I 

GCTAGCCAACAAAGCCGACGTl G'l G'l L'lLAAAATCTCTGATGTTACATTGCAC 

1 •-- + + - . + .. + +-• - 

CXJATCGGTTCTTCGCTGCAACACAGAGTTTTAGA GACTACAATGTAACCGTG 

ACAGTAATAGAAGGGGTCITAJXiAGOCAT^^ 

101 + + + + +.. 

TGTCATTATGTTtXXXIACAATACIXXXTrATAAGTO 
N 

& 
I 

20! ATGGGCTOGCGATAATGTCGGGCAATCAGGTGCGACAATCTAlXrGTIXTrAT 

TA(XXX^GCGCTATTACAGCOCX7reAGTC 

CrnXTAATGATCmCAGATCAGATtKnCACACTAAA 
301 ♦ - ♦ + + + _ 

CAAOCGTrACTACAATGrTCrACTCrAOCAGTCTGATTrc 

CATGCnACICAOCACWKXiATOOOOGGGAAAACAGCATIXXIAGGTATTAGA 
401- • + + + 

GTAOCAATCAGTGGTGACXXrTAGGGGCCCrrTlXTrCCT 

OCTTXXKXXXnTXK^TTCGATIXr^^ 

50 - + ♦ + + + m 

06ACX3C(KSOCAAO(7rAAGCTAAGGACAAACA1TAACAGGAAAATI¥nYXXT^ *0 

GTTGATCXX^GTCUTITTG^TGACCAGOCTAATGG ~ 
60h ••••••• • +• - ♦ 

CAACTACCCrCACTAAAACTACIGCTOGC^^ 0 

AGTCAGTACCACTAAAGAGTGAACTATIXKIAATAAA^ x 
GGATCTYGCCATCCTATCGAAC1 1 U^GIGAG 1 1 nrTCdTCAITACA t 

CCTAGAACGGTAGGATACCTrGACGOAGCXACTtZAAAAGAGGA^ ^ 

< 

TTGTTGAATAAATCGAACJ TH GCTG AGTTCAAGGATCAG ATXTACCCATCTIX: 5 
1001* • • - ♦•- + + + + 

AACAACTTATTTAGCTTGAAAACGACIGAACTTCCrAGT^ 

AACIXXn^OCAOCTACAACAAAGCTCTCATCAACOCnXXXTItXXTrc 
1101 — + + + + 

KAN CASSETTE^ S BEGIN E.RAP 

CCItaCCAGGCAGACCrcACTAGTTCCACT^ 

1201 --- + + - + ♦ 4... 

G^GTGCItXXTTCTGGAGTGATCAAGGTGACTCGCAGTCrc 

CTIGCAAACAAAAAAACCACCGCTACCAGCGG1 GG'l I'l G' iTl GCCGGATCAA 

1301 + + + + + 

GAACGTnGTITITrrCXrrGGCGATGGTCGCCACO^ 

ATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGG<XACCACTTCAAGA 
1401 + + + + + .. 

TATGGTmTGACAGOIAAGATC^CATCGGCATCAATCCGGTGGTGMGT^ 
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FIG. 22B 

CAGCTCGCTC^GAGCarrGAArcACAGGGOGAACXXXAGCO^ 
+ + + + + 100 

GGTCGAGaJACTCTCGGCACITGCTCro^ 
< ITrrciTCACIXXTACXAGCnX^T 

C\J ---•---- + -...-.-.- + • • - - - + 200 

Cg GAGAAGTCACGTCGTCGAGGTTAGAOCACACITACCGGGAGCAGACAAG 
GGGAIXXXnTGCAAtXXXXTrATGCCCAGGra 

0 + + + + + 300 

U. TCCCTACGCAA(XnTGGCGCATACGGGlXXAGIUnLlCAGCGlGTIXnYr 

1 C^GCCGAGCGGGre^ 

> atl P core 

X UTCGGCIGCATCCTCTAAGTGGAAAGAAATTCCAGGrrc 

O + + - * + + + 500 

H AGAGCCGACGTAGGAGATTG^CCrriUll'lAAOGTCXL\GCAlCllCGOGC 

5 GAGAGGAGACCTAGTTGGCAA(XTOGCGGATGGGGATCGC^ 

+ + + + - - + 600 

GCIXnrCIXnGGATCAACCGTGCAGCGCCTACXXXTAGOGACTIXnGA 

Pa a 

I 

GGTACTAGGCGCTGCAGAOCTAOGACAAGAAG4XCGACGCCGAA>GCCrG 

♦ • + + + 700 

CCATGATGCGCGACGTCTXXSATGC'lUl'l^ 

YY AL QT YD NKMDA E A W 

Inl start? 

accgggcgaagaaggcagccgccacx:gccatcacgctggaggagtac 

+ + + + + 800 

CClTXKXCGCITCTrcCGTCXXXXXT^ 
D RAKKAAA S A IT LEE Y 
TACAGCGGGCACGCGGAGCGCCGCATCTACCCCGTGCTAGGTCAAGTGG oftrt 

+ + + + + 900 

ATGTCGCXXXnXXXKTCTCGCXiXXXnAGATGGGCCAOGATCCACTTX^rc 
YSGIIAERRIYPVLGEVA 
GTAC,GAACX^CCCGACTGCCrGCCCXX:ATr,CCTAC^ 

+ + + + + 1000 

ATCCTrcGTGGGCTCACGGGCGGCCCTACGGATCnTGCAGG 
RKHPTA RRHA YNVLRA 

ATCGAGCAGAAGGCAGCOGATCAGCGCGAOGTAGAGGCGCTGAOGCCr 
+ + + + + 1100 

TAGCTCGTCTrcCGTCGGCTACTCGCGCTGCATCrcCGCGACrGOGGA 
I EQKAA DER D V E A LTP 

CATACATCCTGGCGTGGACGAGCXrrCCXMTItXX*AGAGCTGATCGAGC 
+ + + + + 1200 

GTATGTAGGACCGCACCIXXrrttKJAGGCCAAGGCTCTCGACrAGXnX^ 
YILAWTSLRFGELIEL 

G(XGTGGCGCrTCCOGCGTGGGGAACAAGA'l LV1 UG I ' 1 GGCAACGCCAA 

+ + + + + 1300 

CGGCACCGCG AA GGGCGCA CCCC 11 G 1 1 L" 1 AGCAGCAACCGTTGCGGTT 

RGAS RVGNKIVVGN AK 
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Fl O. 22A 



j GTCGACCACCAAGGGCACCATCICrGCTIXXXKXACCCCGTIWKXGGlGC 

CACCIXXnXKnTCCOCTGGTAGAGAOGAACCOGGTGGGGCAACCGG^ 
l0J GCGAGGGTTCCGACXrGCTGCAACrcCCGGTCCAACCTTGirCCGGTCTAT 

CGCTCa^GGCTtXXXJAOTITCAGGGCC^^ CO 

GCGCAGGCGGGGGGCIUTATl VLSI liUlCAGCATCGAAAGTAGOCAGATCA ^ 
201 + + + + + 

CGCGTCCGCCCCC(X»AGATAAGCAAACAGTCGTAGCTITCATCGGTCTAG O 

301 ?3^ GACCCC ^^ GA A^' 1 ^ ^ 

AA(XTCTCG<K5ACcinU x 

Nde I i 

i - 

401 TGGGTGIUlGOCGACCACATATGGGCGGGTCAAGATAGGTrnTACCGCCT £ 

ACCACAGACGGCIGGTGTATACOCGGCCAGTTCTATCCAAAAAATGGGG^ X 

501 TTCAAGGCTITIAGAGTTGCAG\GGAGT1TXIAAQ^^ K 

+ + + + + . < 

AACTTCGGACTCTCAACGTGTCCrcAACGTrcGG 5 

Bamlll 
I 

^ AG0GCAGOGGGAGGATCCAA0CCICATA0GTCAAC0CGCAGGACCt?rfnWA 

TCGCGTCG<XCTCCTAGGTrcGGAGTATGCAGTItXXKJGTCCTC 
Inl V R 

Int sUrl? 

CGCGGGCGAGAAGCGGCTCATCGAGATGGAGACCTGGACCCCTCCACAGG 
701 + + + + + . 

CGAGCGCCOGCTCTTCXXXrGAGTAGClXn'ACCIXrrCGACCTGGGGAGGTtTr 
Int LAGEKRLIEMETWT PPQ 

ACCCGGAAGTGGCTCGTGGAGCXX!GACCTCGCAGACGGCACCAGGGATCrG 
801 + + + + + . 

TGGGCCTIXj^CCGAGCAOCTCXXXXntXJAGCG^ 

TRKWLVERDLADGTRDI, 
CGGTCACAGAGATGACXXX^GCTXnt X TroCGTXXG^ 

*>1 + 4- + + + 

GCCAGTGTCTCTACTGOGGTOGAGACCACGCACGCACCAC(XrGGCCCTACCC 
VTEMTPA LVRAWWAGMG 

GGTGATGAACACAGCGGTXXIAGGACAAGCTGATCXXrAGAGAAfXOGTGCCGG 
iOOl + + + + + _ . 

CCACTACITGTGTCGCCAGCIXXTXTITCGACTAGOGTCIXTITGGG 
VMNTA VEDK LIAENP CR 

Bglll 
I 

GAGGAGCTGGACATCGTCGOCGCTCAGATCTTtXAGCACTAOCGGATCGCGG 

1101 + + + + + 

CTCCIXXJACCTGTAGCAGCXXXXJACTCTAGAAGCTCGTC 
EELDIVAAEIFEIIYRIAA 

TTCGOCGCAA GGACATCGTGGACG ACGGCATG ACG ATGAAGCTCCtKKITGC 

1201 + + + + + . 

AAGCGGCGTTCCTGTAGCA(XTGCTGCCGTACTGCTACTrcGAGG<XCA^ 
RRKDIVDDGM T M K I. R V R 
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